M = Syntax + Prosody: A syntactic-prosodic labelling scheme for large spontaneous speech databases
نویسندگان
چکیده
In automatic speech understanding, division of continuous running speech into syntactic chunks is a great problem. Syntactic boundaries are often marked by prosodic means. For the training of statistical models for prosodic boundaries large databases are necessary. For the German Verbmobil (VM) project (automatic speech-to-speech translation), we developed a syntactic±prosodic labelling scheme where dierent types of syntactic boundaries are labelled for a large spontaneous speech corpus. This labelling scheme is presented and compared with other labelling schemes for perceptual±prosodic, syntactic, and dialogue act boundaries. Interlabeller consistencies and estimation of eort needed are discussed. We compare the results of classi®ers (multi-layer perceptrons (MLPs) and n-gram language models) trained on these syntactic±prosodic boundary labels with classi®ers trained on perceptual±prosodic and pure syntactic labels. The main advantage of the rough syntactic±prosodic labels presented in this paper is that large amounts of data can be labelled with relatively little eort. The classi®ers trained with these labels turned out to be superior with respect to purely prosodic or syntactic labelling schemes, yielding recognition rates of up to 96% for the two-class-problem `boundary versus no boundary'. The use of boundary information leads to a marked improvement in the syntactic processing of the VM system. Ó 1998 Elsevier Science B.V. All rights reserved.
منابع مشابه
Prosody in a corpus of French spontaneous speech: perception, annotation and prosody ~ syntax interaction
Our study focuses on the issue of prosodic annotation and of the prosody ~ syntax interface in conversation and is based on a large corpus of conversational speech in French. The results of inter-transcriber agreement tests show that two expert transcribers are consistent in their labeling of prosodic phrasing and the consistency is well above the chance. A qualitative analysis reveals transcri...
متن کاملSLAM: Automatic Stylization and Labelling of Speech Melody
This paper presents SLAM : a simple method for the automatic Stylization and LAbelling of speech Melody. This main contributions over existing methods are : the alphabet of melodic contours is fully data-driven, an explicit time-frequency representation is used to derive complex melodic contours, and melodic contours can be determined over arbitrary prosodic/syntactic units. Additionally, the s...
متن کاملrosody/Parse Scoring and its Application in ATIS
Prosodic patterns provide important cues for resolving syntactic ambiguity, and might be used to improve the accuracy of automatic speech understanding. With this goal, we propose a method of scoring syntactic parses in terms of observed prosodic cues, which can be used in ranking sentence hypotheses and associated parses. Speciically, the score is the probability of acoustic features of a hypo...
متن کاملMapping Syntax and Prosody
The relationship between prosodic structure and syntactic structure has remained a controversial and unresolved area, partly due to the lack of rich corpora of natural speech, and partly due to the complexity involved in both syntax and prosody. Chom-sky & Halle (1968, p. 372) state that " although there is a substantial literature on intonational and prosodic features in English, it is largely...
متن کاملBootstrapping the syntactic bootstrapper: Probabilistic labelling of prosodic phrases
The syntactic bootstrapping hypothesis proposes that syntactic structure provides children with cues for learning the meaning of novel words. In this paper, we address the question of how children might start acquiring some aspects of syntax before they possess a sizeable lexicon. The study presents two models of early syntax acquisition that rest on three major assumptions grounded in the infa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Speech Communication
دوره 25 شماره
صفحات -
تاریخ انتشار 1998